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FOREWORD 


An  understanding  of  the  mathematical  theory  of  finite  elements 
becomes  necessary  when  conducting  research  into  advanced  techniques  for 
finite  element  application.  Such  an  area  of  research  exists  in  the 
development  of  contact/impact,  finite  element  formulations  presently 
being  pursued  at  the  Civil  Engineering  Laboratory  (CEL).  The  work 
reported  herein  supports  that  project. 

Much  of  the  work  presented  was  accomplished  while  the  author  was  on 
leave  for  one  academic  year  at  the  University  of  California,  Santa 
Barbara,  under  the  auspices  of  the  CEL  Fellowship  Program.  There, 
research  was  begun  on  the  development  of  contact/impact,  finite  element 
formulation,  but  soon  it  became  evident  that  a study  of  the  mathematical 
theory  of  finite  element  was  first  necessary  for  a greater  appreciation 
of  element  performance,  convergence,  and,  in  general,  understanding  why 
the  finite  element  method  works  as  it  does. 

An  important  outcome  of  the  study  is  the  belief  that  some  important 
concepts  of  the  mathematical  theory  of  finite  element  can  be  presented 
to  many  practitioners  without  necessity  for  reference  to  higher 
mathematics  because  these  concepts  can  be  cast  in  the  familiar 
geometrical  notions  of  Euclidean  space.  This  report  was  written  and 
presented  with  that  belief  foremost  in  mind. 
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INTRODUCTION 

The  purpose  of  this  report  is  to  provide  an  explicative  introduction 
to  the  mathematical  theory  of  the  finite  element  method.  To  really 
understand  the  finite  element  method  today,  it  is  necessary  to  know 
something  of  its  mathematical  foundations  as  they  relate  to  accuracy, 
convergence,  valid  shape  functions,  and  other  mathematically  based 
aspects  which  are  associated  with  the  method.  Results  from  research 
into  these  questions  are  beginning  to  emerge  and  will  soon  impact  the 
practitioner  with  greater  frequency.  The  mathematical  theory  that  has 
been  established  to  date  is  not  complete,  but  it  is  sufficient  to  under- 
stand both  why  the  finite  element  method  works  and  why  it  is  efficient 
in  the  numerical  solution  of  engineering  boundary  value  problems. 

The  major  obstacle  in  understanding  is  that  the  language  must  of 
necessity  involve  a minimum  of  functional  analysis.  This  is  a branch  of 
mathematics  with  which  most  engineers  are  unfamiliar.  To  that  end,  the 
report  begins  with  a section  on  prerequisite  functional  analysis.  The 
intent  is  to  present  only  enough  information  to  enable  most  engineers  to 
interpret  the  necessary  functional  analysis  concepts  within  familiar, 
geometrical  frameworks.  These  concepts  are  simple  when  viewed  as  gener- 
alizations of  the  ordinary  three-dimensional  Euclidian  space  notions  of 
distance,  dot  products,  Pythagorean  theorem,  orthogonal  projection,  and 
other  concepts. 

Mathematical  respectability  began  to  come  to  the  finite  element 
method  when  the  unknown  nodal  point  variables  were  recognized  as  the 
unknown  coefficients  of  the  classical  Rayleigh-Ritz  method.  Thereafter, 
many  of  the  initial  attempts  at  clarifying  the  mathematical  theory  were 
presented  in  the  framework  of  the  Rayleigh-Ritz  procedure.  But  soon  it 
was  realized  that  problems  were  being  routinely  solved  by  the  finite 


element  method  that  ronid  not  also  he  formulated  in  a classical  Rayleigli- 
Ritz  way.  The  Rayleigh-Ritz  method  requires  the  existence  of  a varia- 
tional formulation  of  a problem  prior  to  its  use  in  the  problem's  solu- 
tion. Yet,  many  problems  being  solved  had  no  such  corresponding  formu- 
lation. 

For  example,  in  structural  mechanics  most  problems  were  being 
formulated  on  the  basis  of  the  principle  of  virtual  work  which  is  in 
fact  more  general  than  variational  formulations.  Therefore,  the  mathe- 
matical theory  could  not  be  completely  developed  from  a classical 
Rayleigh-Ritz  viewpoint. 

From  then  on,  the  method's  development  was  no  longer  the  sole 
province  of  engineers  who  had  originally  developed  the  method  and  who 
indeed  coined  the  phrase  "finite  element."  Investigators  who  were 
familiar  with  variational  methods  atid  who  also  knew  of  a more  general 
method  - the  classical  Galerkin  method  - soon  saw  the  advantage  of 
thinking  in  terms  of  that  method  when  seeking  fundamental  knowledge  on 
the  finite  element  method.  Today,  the  finite  element  method  is  thought 
to  be  founded  in  an  extension  of  the  classical  R i t z-Ga lerk in  theory. 

The  finite  element  method,  as  a result,  is  now  applicable  to  problems 
in  fluid  mechanics,  heat  conduction,  electrostatics,  and  other  areas, 
besides  the  traditional  problems  in  elasticity  and  structural  mechanics. 

In  general,  whenever  the  solution  of  a differential  equation  is  sought 
within  a region  bounded  by  geometrically  complex  or  "real"  shapes,  the 
finite  element  method  is  often  more  advantageous  than  other  methods. 

The  principal  alternative  method  for  the  numerical  solution  of  boundary 
value  problems  is  the  finite  difference  method. 


SOME  PREREQUISITES  FROM  FUNCTIONAL  ANALYSIS 

Functional  analysis  is  the  study  of  vector  spaces  resulting  from  a 
merger  of  geometry,  linear  algebra,  and  analysis.  It  serves  as  a basis 
for  aspects  of  several  important  branches  of  applied  mathematics,  including 


Fourier  series,  integral  and  differential  equations,  numerical  analysis, 
and  any  field  where  linearity  plays  a key  role.  Its  appeal  as  a unifying 
discipline  stems  primarily  from  its  geometric  character.  Most  of  the 
principal  results  in  functional  analysis  are  expressed  as  abstractions 
of  intuitive  geometric  properties  of  ordinary  three-dimensional  space. 

Within  the  last  10  years,  the  finite  element  method,  which  is  about 
20  years  old,  has  been  given  mathematical  respectability.  Gradually  the 
method  has  been  recognized  as  an  extension  of  the  Rayleigh-Ritz-Galerkin 
technique.  To  stury  the  mathematical  foundation  of  finite  element  it  is 
first  necessary  to  become  comfortable  with  a certain  number  of  concepts 
from  functional  analysis.  Examples  would  include  the  concept  of  a norm 
and  its  interpretation  as  a natural  measure  of  strain  energy,  and  the 
identification  of  a Hilbert  space  as  the  collection  of  admissible  func- 
tions in  a physically  derived  variational  principle. 

The  aim  here  is  to  present  a minimum  amount  of  functional  analysis 
believed  necessary  to  comprehend  the  mathematical  foundations  of  the 
finite  element  method.  We  begin  essentially  from  "ground  zero,"  i.e., 
with  the  definition  of  a vector  space  and  proceed  up  to  the  definition 
of  a Hilbert  space.  Each  succeeding  definition  builds  upon  the  preceding 
definitions.  Examples  are  given  freely  to  aid  the  comprehension  of  the 
def i n i t ion . 

Many  of  the  following  definitions  hav"  been  extracted  from  Luenberger. 
Though  the  author  does  not  address  the  subject  of  finite  element,  its 
treatment  of  functional  analysis  is  well-suited  to  an  introduct ioti  to 
the  mathematical  theory  of  finite  element. 

Linear  Vector  Space 

A linear  vector  space  X is  a set  of  elements  called  vectors  together 
with  two  operations 

(1)  Addition;  given  x,y  6 X 

then  X + y e X 


*David  G.  Luenberger.  Optimization  by  vector  space  methods.  New  York, 
N.Y.,  John  Wiley  and  Sons,  Inc.,  1969. 


(2)  Scalar  Hul I ipl ical ion  : 

given  X e X and  a any  scalar 
then  ax  G X 

The  set  X and  the  above  two  operations  are  assumed  to  satisfy  the  following 
seven  axioms. 

( 1 ) X + y = y + X 

(2)  (x  + y)  + z = X + (y  + z) 

(3)  X + 0 = X (all  X G X where  0 is  defined  as  a null  vector) 

(4) a(x+y)  - axtay 

(5) (a+P)x  = ax+px 

(6)  (ap)x  = a(px) 

(7)  Ox  = 0 

Some  examples  of  vector  spaces  are: 

A.  Set  of  real  numbers.  Addition  and  multiplication  are  defined 
in  the  usual  way.  The  real  number  zero  plays  the  role  of  the  null 
vector.  This  space  is  denoted  as  R*.  It  is  the  one-dimensional  real 
coordinate  line. 

B.  An  n-dimensional  real  coordinate  space  . 


X = (C, , ^2’  • • • ’ ^n^ 

0 = (0,  0,  . . .,  0) 

y = (ij , n2»  • • • • n^) 

L i \ 

i (ofx)^  = a^ 

0 C.  Collection  of  all  real -valued  continuous  functions  on  the 

interval  |a,b)  of  the  real  line,  C|a,b|. 


I 


0 


0 


= f(t) 


for  all  t on  ( a ,b  | 


(x  + y)(l)  = x(i)  + y(l) 


(ffx)t  = tfx(l) 

Example  C illustrates  that  a vector  space  may  be  composed  of  a set 
of  continuous  functions  and  therefore  should  not  be  thought  of  as  being 
composed  of  "vectors  per  se"  in  all  cases.  Indeed,  the  vector  spaces 
pertaining  to  finite  element  theory  are  composed  of  elements  that  are 
continuous  functions. 

Cartesian  Product 


Let  X and  Y be  vector  spaces  over  the  same  field  of  scalars.  Then 
the  Cartesian  Product  of  X and  Y,  denoted  X x Y,  consists  of  the  collec- 
tion of  ordered  pairs  (x,y)  with  x e X arid  y € Y.  The  Cartesian  product 
is  a vector  space  with  addition  and  multiplication  defined  as  follows: 

(1)  Addition:  (Xj.yj)  + (x2,yp)  = (x^  + ^2’^1  * ^2^ 

(2)  Multiplication:  a(x,y)  = (ax,wy) 

These  definitions  satisfy  the  seven  axioms  of  a linear  vector  spare. 
Subspace 

A nonempty  subset  M of  a vector  space  X is  called  a subspace  of  X 

if  every  vector  of  the  form  ax  + Py  is  in  M whenever  x and  y are  both  in 

M.  Some  examples  of  subspaces  and  subsets  are  shown  in  Figure  1 with 
2 

respect  to  the  R vector  space. 


5 


origin.  M 


Figure  1.  Subspaces  and  subsets. 


2 

Only  examples  (c)  and  (d)  are  subspaces  of  R . The  others  are  merely 

2 

subsets  of  R . 

A subspace  is  itself  a vector  space.  If  it  is  not  equal  to  the 

entire  space,  as  in  the  above  two  examples,  then  it  is  a proper  subspace 

3 

Examples  of  proper  subspaces  in  R include  both  a plane  and  a line 
running  through  the  origin. 

Norm 


A norm  is  a measure  of  the  size  of  an  element  belonging  to  a vector 
space  or  the  distance  between  two  elements  in  the  space  and  is  denoted 
by  jjxjl  or  ||  x - y ||  , respectively.  If  the  vector  space  X has  a norm 
defined  on  it,  it  becomes  a normed  vector  space.  This  means  that  there 


is  a real  valued  function  which  maps  each  element  x in  X into  a real 
number  11  x ||  called  the  norm  of  x.  The  norm  is  assumed  to  satisfy  the 


fol lowing 

three 

(1) 

Ml 

(2) 

II  "HI 

(3) 

||x  . y 

The 

first  a: 

^ OforallxGX, 


= 0 i f X = 0 


= o x 


for  all  scalars  or  and  all  x e X 


for  each  x,y  e X 


positive  measure.  If  the  measure  of  something  is  zero  (||  x ||  =0)  then 
that  something  must  be  zero  (x  = 0)  and  conversely  if  something  is  zero 
(x  = 0)  then  its  measure  should  be  zero  ( ||  ^||  - 0)  • 

The  second  axiom  states  that  x and  its  negative  -x  have  the  same 
measure,  and  that,  for  example,  the  measure  of  3x  is  three  times  the 
measure  of  x. 

The  third  axiom  is  the  triangular  inequality,  so-called  since  the 
sum  of  the  lengths  of  two  sides  of  a triangle  is  never  smaller  than  the 
length  of  the  third  side. 

A norm  is  merely  an  abstraction  of  our  usual  concept  of  length. 
Some  examples  of  norms  follow.  Some  of  the  following  real  valued  func- 
tions satisfy  the  three  axioms,  and  some  do  not. 


If  X = R : 


(a)  X 


n 

I cannot  be  used,  violates  axiom  (1) 

i=l  ^ 


(b)  x 


1 

i = l 


C, 


is  a valid  norm 


(c)  ||x 

(d)  llx 


" 2 

I 4-  cannot  be  used,  violates  axioms  (2)  and  (3) 
i=l  ^ 


max  t. 
i = l ,n  I ^ 


is  a valid  norm 


I 


7 


(e)  X 


If  X = C I a , b I : 
(f)  11x11  - 


is  a valid  and  very  useful  norm 
called  the  Euclidean  norm. 


max  |x(l)|  is  a valid  norm  and  is  the  largest  value 
aStSb  of  the  function  x(t)  on  the  interval  |a,b| 


(g)  X = / x(t)|dt  isa  valid  norm 


rilo 


X x(t)  dt 


This  is  a valid  norm  and  very  useful  in  finite  element  theory.  We  give 
it  the  special  designation  norm.  This  norm  can  be  generalized  to 
include  the  norms  of  the  functions  and  its  derivatives.  For  example: 


(i) 


f |lx(t)P  + lx'(t)|^  + |x"(t)P]dt\ 


This  norm  is  also  referred  to  as  the 


Convergence 


1 ti  a normed  linear  vector  space  an  infinite  sequence  of  vectors 
{x^!  is  said  to  be  convergent  to  a vector  x if  the  sequence  of  real 
numbers  jj  x - x^||  converges  to  zero.  In  this  case  we  write  x^  x. 


Trans  format  ion 

Let  X and  Y be  linear  vector  spaces,  and  let  D be  a subset  of  X.  A 
rule,  which  associates  with  every  element  x € D a corresponding  element 
y e Y,  is  called  a transformation  from  X to  Y with  domain  D.  This 


transformation  is  illustrated  in  Figure  2.  We  write  y = T(x).  The 
collection  of  all  vectors  y e Y for  which  there  is  an  x t D with  y = T(x) 
is  called  the  range  R of  T. 


Figure  2.  Transformation  between  two  vector  spaces. 


Funct ional 

A transformation  from  X into  the  space  of  real  numbers  (scalars) 
is  called  a functional  of  X,  usually  denoted  f(x),  g(x) , etc.  Examples 
of  functionals  include  the  following: 

(a)  f(x)  = ||x|| 

b 

(b)  f(x)  = / x(t)dt 

a 

The  idea  of  a functional  is  central  to  the  Rayleigh-Ritz  technique  and 
variational  principles. 

Cauchy  Sequence 

A sequence  {x^}  in  a normed  space  is  said  to  be  a Cauchy  sequence 
if  11  - x^ll  ->  0 as  m,n  ->  ®.  In  a normed  space  every  convergent 

sequence  is  a Cauchy  sequence  since. 


9 


However,  a Cauchy  sequetice  is  not  necessarily  convergent.  For 
example,  jl/n}  is  a Cauchy  sequence  in  the  space  X = R - 0 which  is  the 
real  line  R minus  the  origin.  But  this  Cauchy  sequence  is  not  convergent 
in  X since  {1/n}  converges  to  0 which  is  outside  the  space  X. 

More  generally,  if  we  lake  any  sequence  of  points  in  a vector  space 
which  converges  to  a limit  that  is  not  one  of  the  terms  in  the  sequence, 
and  then  delete  the  limit  from  the  vector  space,  we  get  a Cauchy  sequence 
which  is  not  convergent. 

Complete 

A normed  linear  vector  space  is  complete  if  every  Cauchy  sequence 
from  X has  a limit  in  X.  That  is,  X must  contain  the  limit  as  well  as 
the  members  of  the  sequence.  A complete  normed  linear  vector  space 
defines  a Banach  space. 

Generally,  it  is  advantageous  to  formulate  problems  in  such  a space 
because  we  can  easily  test  sequences  for  convergence  by  asking  whether 
they  are  Cauchy  or  not.  If  they  are,  we  have  assurance  that  the  seq<ience 
is  convergent  since  the  space  is  a completed  space.  We  do  not  have  to 
know,  a priori,  what  the  limit  is,  and  fortunately  so,  because  its  value 
is  often  the  object  of  the  problem. 

Consider  the  space  of  rational  numbers  R'.  Let  {x^}  be  a sequence 
of  rational  numbers  converging  to  ^Tl.  This  sequence  is  Cauchy,  but  R' 
does  not  contain  the  limit  element  V 2 ( V 2 is  an  irrational  number). 
Therefore  R'  is  not  complete. 

While  the  space  of  rational  numbers  is  not  complete,  the  space  of 
real  numbers  R is  complete.  Note  V 2 is  a real  number,  and  thus  R 
contains  the  sequence  and  the  limit. 


10 


It  is  apparent  by  now  that  there  may  be  more  than  one  norined  spare 
of  continuous  functions.  It  depends  only  on  the  definition  of  the  norm. 
To  see  this,  consider  the  space  of  continuous  functions  on  10,1).  We 
can  define  two  different  normed  spaces  as  follows: 

(a)  Let  X be  the  space  of  continuous  functions  on  10,I|  with  norm 
defined  as 

II  X II  = J I x(  t ) I dt 

II  II  Q I 

(b)  Let  X be  as  in  (a)  above  except  that  the  norm  is  defined  as 

II  X II  = max  I x(  t ) I 

' " O^t^l 

These  are  two  different  normetl  vector  spaces,  but  the  first  is  not 
complete  while  the  second  is. 

The  first  is  not  complete  because  a sequence  of  functions  in  X is 
defined  as  fol lows 


X (t) 

n 


0 

nt  - n/2  ♦ 1 
I 


0 ^ t $ 1/2  - 1/n 
1/2  - 1/n  S t ^ 1/2 
t ? 1/2 


A graphical  portrayal  of  this  sequence  and  its  convergence  (lack  of)  is 
shown  in  Figure  3. 


Figure  3.  Convergence  of  a sequence  of  functions. 


First  it  can  be  verified  that  {x  } is  Cauchy  since 

n •' 

II  X • X II  = 1 /2 1 1 /n  - 1 /tn|  ->  0 

II  n mil  I I 

However,  this  sequence  converges  to  a discontinuous  function  as  shown, 
and  therefore  does  not  converge  to  a member  of  X.  As  a result,  the 
normed  vector  space  is  not  complete. 

It  can  be  shown  that  every  Cauchy  sequence  in  the  second  normed 
vector  space  does  have  a limit  in  the  space.  The  main  difference  here 
is  that  the  example  sequence  is  not  Cauchy  with  respect  to  the  norm 
defining  the  space. 

Inner  Product 


On  the  Cartesian  product  space  X x X an  inner  product  is  defined  as 
follows.  Corresponding  to  each  pair  of  vectors  (x,y)  in  X the  inner 
product  <x,y>  of  x and  y is  a scalar.  The  inner  product  must  satisfy 
the  following  four  axioms  in  a real  vector  space  X. 
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<y.z> 


( 1 ) <x,y>  = <y,x> 

(2)  <x  + y,z>  = <x,2>  + 

(3)  <ffx,y>  = a<x,y> 

(4)  <^x,x>  ^ 0 and  <(x,x>  = 0 if  x = 0 

The  inner  product  is  a generalization  of  the  dot  product  in  two-  or 
three-dimensional  Euclidean  space. 

Hilhert  Space 

As  we  have  seen,  a Banach  space  is  a complete  normed  vector  space. 

A Hilbert  space  is  a Banach  space  plus  an  inner  product  which  defines 
the  norm;  i.e.,  ||x||  = <(x,x)>'^^. 

The  key  concept  involved  with  a Hilhert  space  is  orthogonality. 

This  concept  is  not  generally  available  in  other  spaces.  Two  vectors  in 
a Hilbert  space  are  defined  as  orthogonal  if  their  inner  product  is 
zero.  The  methods  of  Ritz,  Galerkin,  and,  of  course,  finite  element 
have  natural  settings  in  the  framework  of  Hilbert  spaces,  as  do  the 
concepts  of  optimization,  Fourier  series,  least-squares  minimization, 
and  ortho-normal  bases.  Hilbert  spaces  then  provide  a unifying  foundation 
for  many  areas  of  engineering. 

Projection  Theorem 


The  shortest  distance  from  a point  to  a plane  is  a line  given  by 
the  perpendicular  from  the  point  to  the  plane.  This  obvious  and  intuitive 
result  can  be  generalized  to  the  problem  in  any  higher  space  (n-dimensiona 
Euclidean  space  or  a continuous  function  space)  of  finding  the  "shortest 
vector,"  as  measured  by  a valid  norm,  from  a point  to  a subspace.  The 
approach  seeks  to  find  that  vector  which  is  orthogonal  to  the  subspace 
This  is  a brief  introduction  to  the  projection  theorem. 
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Let  H be  a Hilbert  space  and  M a closed  subspace  of  H.  Corresponding 
to  any  vector  x e H,  there  is  a unique  vector  e M such  that 
II  X - m^ll  % II " '"11  "*  ^ Furthermore,  a necessary  and 

sufficient  condition  that  ni  be  the  unique  minimizing  vector  is  that 
X - ni^  be  orthogonal  to  M.  The  three-dimensional  version  of  the  projec- 
tion theorem  is  shown  below  in  Figure  4. 


Figure  4.  Projection  theorem  concept. 


SOME  FINITE  ELEMENT  CONCEPTS  FROM  MATHEMATICAL  THEORY 

If  one  is  not  interested  in  the  mathematical  questions  of  what 
constitutes  valid  shape  functions,  accuracy,  convergence,  and,  in  general, 
why  the  method  is  efficient  in  many  problems,  it  would  not  be  necessary 
to  know  little  beyond  the  basic  principles  of  virtual  work  and/or  equi- 
librium to  enable  successful  application  of  the  method.  Further,  the 
basic  reason  that  the  finite  element  method  is  successful  and  efficient 
is  that  the  polynomial  shape  functions  which  it  employs  are  both  compu- 
tationally efficient  and  good  mathematical  approximations.  The  method 
does  not  owe  its  success  to  physical  principles,  though  this  sometimes 


I 


I 


I 


I 


I 


I 


i 

I 

i 

f 

f 

* 

\ 


appears  to  be  the  case  since  these  principles  are  the  source  of  the 
all-important  governing  differential  or  integral  equation.  However,  the 
finite  element  method  allows  an  efficient  discretization  and  numerical 
solution  of  the  governing  equations  after  they  have  been  established. 

Before  the  ingenious  application  of  the  shape  functions  that  are  attrib- 
uted to  the  finite  element  method,  the  discretization  was  often  accom- 
plished with  classical  Ritz  functions.  The  discretization,  however, 
remains  fundamentally  a Ritz  process,  and  to  understand  the  finite 
element  method  and  the  reason  for  its  success,  it  is  necessary  to  follow 
through  the  development  of  the  Ritz  method. 

Approximate  Method  of  Rayleigh-Ri tz 

There  are  a number  of  references  on  the  Rayleigh-Ritz  method,  but  a 
book  by  Mikhlin*  is  frequently  noted  in  the  literature  of  finite  element. 

This  reference  was  used  in  the  preparation  of  the  following  material, 

I 

and  though  it  covers  a good  deal  more  than  Rayleigh-Ritz  it  still  is  j 

very  complete  on  the  subject.  | 

% 

Consider  solving  the  time-independent  differential  equation  \ 

i 

) 

L u = f (1)  j 

I 

in  some  domain  Q where  L is  some  differential  operator  which  acts  upon 
an  unknown  function  u to  yield  a known  function  f.  On  the  boundary  f of 
0,  u must  satisfy  certain  boundary  conditions.  The  differential  operator 
L is  linear  and  defined  for  a subspace  M of  a real  Hilbert  space  H.  If 
L contains  derivatives  up  to  the  mth  order,  then  M is  the  space  of  all 
continuous  functions  having  continuous  mth  order  derivatives.  The 
larger  space  H is  taken  as  the  space  of  all  functions  which  are  square 
summable  and  have  the  inner  product 

<u,v^  = J u V d n u,v  € H 
Q 

*S.  G.  Mikhlin.  Variational  methods  in  mathematical  physics.  Oxford, 

England,  Pergamon  Press,  1964. 
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This  merely  means  that  <u,v)>  < » and  therefore  the  inner  product  will 
exist.  (This  is  not  a very  restrictive  definition  at  all,  and  therefore 
it  is  useful  to  think  of  H as  being  very  large.)  Let  f also  be  a member 
of  H so  that  Equation  1 defines  a linear  transformation  or  mapping  as 
shown  below  in  Figure  5. 


Figure  5.  Linear  transformation  from  subspace  M to  space  H. 


Let  be  a subspare  of  M such  that  in  addition  to  the  requirements  of 
being  a member  of  M,  the  functions  u € must  also  satisfy  the  boundary 
conditions.  The  term  is  called  the  field  of  definition  of  operator 
L.  For  the  time  being  let  it  be  stated  that  operator  L must  be  positive 
definite;  that  is,  <(Lu,u>  ^ 0 for  all  u G where  the  equality  holds 
if,  and  only  if,  u = 0 (6  being  the  null  vector). 

With  the  above  definitions  established  we  state,  without  proof,  the 
following  equivalent  variational  formulation  of  the  boundary  value 
problem.  Finding  the  function  u*  which  yields  the  minimum  of  the  quadratic 
functiona 1 

F(u)  = <Lu,u>-  2<u,f>  (2) 

is  the  same  as  solving  the  governing  differential  Equation  1 for  u.  A 
specific  example  of  a variational  formulation,  common  to  structural 
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engineers,  is  the  solution  of  the  partial  differential  equation  of 
static  elasticity  theory  by,  instead  of  integrating  the  differential 
equation,  finding  the  minimum  of  the  potential  energy  functional  for  the 
elastic  body.  A more  concise  statement  of  the  equivalency  is  as  fol- 
lows. If,  in  addition  to  being  positive  definite,  L is  a symmetric 
(<Lu,v^  = ^Lv,u^)  operator,  then  i f Lu  = f is  to  have  a solution  in  M, 
this  solution  is  also  the  minimizing  function  for  Equation  2.  Con- 
versely, if  u“  minimizes  F(u)  then  u*  = u and  is  the  solution  to  Lu  = f. 

In  solving  the  linear  differentia]  equation  of  Equation  1 it  should 
be  noted  that  while  the  given  function  f can  be  any  function  so  long  as 
it  is  a member  of  the  Hilbert  space  H,  the  sought-after  function  u must 
be  a member  of  the  field  of  definition  of  the  linear  operator  L as  well 
as  a member  of  H.  Thus  if  L is  a fourth  order  operator,  u must  be  a 
continuous  function  having  a continuous  fourth  order  derivative  and  must 
satisfy  certain  boundary  conditions  to  be  a member  of  the  field  of 
definition  Dj^. 

It  can  therefore  occur  that  for  some  functions  f e H there  will  not 
exist  a function  u in  the  field  of  definition  that  will  satisfy  the 
differential  equation.  As  an  example  of  this  case,  consider  the  problem 
of  finding  the  deflection  of  a uniform  cantilever  beam  under  the  action 
of  a uniformly  distributed  load  q(x).  The  appropriate  equations  are 

Ely''''  = q(x)  X € |0,Ll 

y(o)  = y'(r)  = y"(L)  = y'"(L)  = 0 

The  differential  equation  is  derived  by  considering  the  equilibrium  of 
an  infinitesimal  length  of  the  beam  under  the  assumption  that  the  loading 
is  continuous  across  the  infinitesimal  length. 

In  this  case  the  operator  is 


i 

I 
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Its  field  of  definition  Dj^  is  defined  as  the  totality  of  those  functions 
defined  over  |0,Li  which  possess  continuous  fourth  derivatives  and  which 
satisfy  the  boundary  conditions  of  the  problem.  If  q(x)  is  continuous 
everywhere  in  |0,L)  there  will  exist  a solution  y in  Dj^.  But  if  q(x)  is 
discontinuous,  no  solution  can  be  found  in  since  y would  have  to 
possess  a discontinuous  fourth  derivative  to  satisfy  the  governing 
equation.  It  should  be  noted  that  a discontinuous  q(x)  is  an  entirely 
legitimate  possibility  because  it  would  be  square  summable  and  therefore 
a member  of  H.  In  a practical  sense  a discontinuous  q(x)  is  very  common. 

This  difficulty  "an  be  overcome  by  considering  limits  of  functions 
that  lie  in  Just  as  a discontinuous  load  q(x)  may  be  considered  as 

the  limit  of  a sequence  of  continuous  loads,  so  functions  with  discon- 
tinuous fourth  derivatives  are  introduced  that  are  the  limits  of  sequences 
of  functions  with  continuous  fourth  derivatives.  Extending  the  field  of 
definition  in  this  manner  can  always  be  accomplished  if  L is  a posi- 
tive bounded  below  operator  in  addition  to  being  symmetric  and  positive 

definite.  The  criterion  for  being  positive  bounded  below  is  <(Lu,u>  ^ y <^u,u)> 
2 

where  u € and  y any  positive  constant.  It  can  then  be  asserted 
that  among  this  new  set  of  functions  lies  the  solution  of  the  differen- 
tial equation  for  any  f G H. 

This  spatial  extension  creates  a new  Hilbert  space  with  an  inner 
product  |u,vl  where  u,v  G and  this  inner  product  is  sometimes  called 
the  energy  product.  It  is  defined  as  follows, 

I u , V 1 = I i m < Lii  , V )>  = < Lu  , v> 

n-*oo 

where  u and  v are  sequences  and  ii  ,v  G D,  . Note  that  through  an 
n n ^ n n L 

integration  by  parts  we  get  a new  operator  R as  follows, 

<Lu  ,v  > = f (Lu  • V )d0  = f (Ru  • Rv  ]dQ. 

" " 0 \ n \ " "/ 
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Thus , 

|u,v)  = 1 im  / (Ru  • Rv  ) dn  = 1 im  f (Rv  • Ru  ) df) 

A It  n n n 

n-x*  Q n-x»  n 

= 1 im  <(Rv  ,Ru  > = ( V ,u  1 , 

n tr 

n-x» 


and  the  energy  product  is  shown  to  be  synunet r i ca 1 . 

The  Hilbert  space  is  important  in  the  theory  of  finite  element. 
The  norm  in  is  the  energy  norm  and  is  defined  as 


^ / (Ru  • Ru) 


The  set  of  admissible  functions  in  is  given  by  considering  sequences 
of  functions,  say  u^,  that  satisfy 


1 im 
n ,m-x» 


0. 


This  implies  the  existence  of  an  element  u with  finite  norm  to  which  the 
sequence  u^  converges  in  the  mean.  This  set  of  functions  constitutes 


Those  functions  constituting  must  satisfy  all  the  boundary 
conditions.  But  some  functions  in  which  contains  often  do  not 
satisfy  all  the  boundary  conditions.  Those  boundary  conditions  which 
are  satisfied  by  functions  in  but  which  are  not  satisfied  by  functions 
in  H,  are  called  natural  boundary  conditions.  Those  boundary  conditions 

la 

which  remain,  or  those  boundary  conditions  which  are  satisfied  by  func- 
tions only  in  are  the  essential  (forced,  principal)  boundary  conditions. 
The  latter  are  required  to  assure  that  the  energy  norm  is  positive 
definite;  i.e.,  |j  u ||  = V Iu,u]  ^ 0. 
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Now  we  return  to  the  problem  of  minimizing  F(ij)  where 


F(n)  = lu.ul  - 2<u,f>  (2a) 

and  where  the  candidate  functions  u are  now  members  of  The  central 

result  in  going  from  Equation  2 to  2a  is  that  now  the  trial  functions  u 
need  satisfy  a fewer  number  of  boundary  conditions  i.e.,  the  essential 
boundary  conditions. 

Following  the  classical  Ritz  method  to  approximate  the  minimizing 
function  of  F(u)  we  select  a solution  of  the  form 


M 

u 


M 

1 

k=l 


(3) 


M 

where  0 are  the  Ritz  lunctions,  and  a are  the  unknown  Ritz  coefficients 

^ ^ M 

which  are  to  be  determined.  If  the  functions  4)|^,  k=l,2,...,M,  are  com- 
plete in  H.  then  convergence  to  the  correct  answer  u is  assured  in  the 

^ M 

sense  of  convergence  of  the  energy  norm.  The  (t)|^  are  complete  in  if 

for  any  v G it  is  possible  to  choose  c > 0 and  find  an  integer  N and 


constants  a 


2’ 


a„  such  that 
n 


V - u 


< e for  M > N 


Substituting  Equation  3 into  Equation  2a,  the  quadratic  functional 
becomes 


F(u  ) 


M 

I I 

k=l 


I a d.'*)  - 2<  I 

k=l  k=l 


"k  '>’k>'> 


(A) 
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or , 


Cl 

F(u  ) 


" M ^ M 

J L ( 2 a 2 a /'  dfi 

Q k=t  k=l 


- 2 J ( 2 a.  0")  f dQ 
Q k=  1 


To  find  the  a^^,  k=l,2,...,M,  corresponding  to  the  niiniimim  of  F(u  ),  we 
differentiate  Equation  4a  with  respect  to  a^  and  set  the  result  to  zero. 


J-  IL  (oM  2 a 0%  L ( 2 a 0^^)  0^  | dfi 

fi  V J / k=l  k=l  J 


2 / 0 f dQ  = 0 


M M 

K .kM.  < s o /.kM  f\ 


Because  the  energy  product  is  symmetrical,  this  equation  reduces  to 


I .kM  V r \ - rv 

10.  , 2 a 0 I - v0  ,f  > = 0 

J k=l  " J 


Finally,  Equation  6 is  rearranged  as  follows. 


J L (0^)  2 a.  0tJ  dQ  - <0^,f>  = 0 

Q J k=  1 ^ ^ J 


or. 


Z a (0^,  0^1  = j = 1,2,. ...M  (7) 

k=  I " J " J 

Thus,  the  end  result  of  the  Ritz  process  is  Equation  7 which  is  a linear 
system  of  M algebraic  equations  from  which  we  can  solve  for  the  a^^  of 
Equation  3. 

Advantage  of  Finite  Element  Shape  Functions 

Strang  and  Fix  provides  an  excellent  discussion  on  the  subject  of 
the  mathematical  theory  of  finite  element.*  The  authors,  who  are  mathe- 
maticians, have  learned  the  "language  of  finite  element,"  and  it  is 
evident  in  their  narrative.  However,  their  treatment  does  assume  that 
readers  are  comfortable  with  many  of  the  functional  analysis  concepts 
presented  earlier  in  this  report.  Most  engineers  who  practice  finite 
element  may  not,  however,  be  knowledgeable  in  this  area  of  mathematics. 
Nevertheless,  the  purpose  of  this  section  and  a main  point  of  this 
report  can  be  satisfactorily  understood  without  thoroughly  understanding 
functional  analysis.  The  authors  contend  that  shape  functions  are  a 
root-cause  of  the  success  of  the  finite  element  method.  This  contention 
is  developed  here  in  the  context  of  the  earlier  introductions  to  functional 
analysis  and  the  Rayleigh-Ri tz  method. 

It  was  stated  earlier  that  the  main  reason  for  the  success  of  the 
finite  element  method  was  its  computational  ease  and  accuracy  and  that 

this  was  a result  of  using  certain  kinds  of  functions  called  shape 
M 

functions  for  the  0^^  in  the  Ritz  process.  This  will  be  shown  in  the 
following  examples  by  using  three  different  kinds  of  functions  for  the 
solution  of  a boundary  value  problem,  and,  in  each  case,  taking  the 
computation  as  far  as  Equation  7. 


*G.  Strang  and  G.  J.  Fix.  An  analysis  of  the  finite  element  method. 
Englewood  Cliffs,  N.J.,  Prentice-Hall,  Inc.,  1973. 
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The  statement  of  the  illustrative  problem  is  as  follows: 


L u = f, 

where 


and 

f = X. 

The  boundary  conditions  are  u(0)  = u(l)  = 0,  and  the  domain  is  = |0,lj. 
We  assume  a solution  of  the  form 


M 

u 


M 

1 

k=l 


Case  1.  Let  the  Ritz  functions  be  the  eigenfunctions  for  the  given 
differential  equation;  i.e., 

d),  = sin  knx. 

k 

These  functions  are  shown  in  Figure  6 and  are  seen  to  satisfy  the  given 
boundary  conditions. 
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Figure  ( 

i).  Eigenfunctions  used  as  Ritz 

f unct  1 ons 

Substituting  L,  f, 

^ J' 

and  (J)|^ 

into  Equation  7,  we  get 

M 

1 

k=l 

' 

a,  / 1— 

0 dx 

2 (si" 

jnx)  + sin  jnx|  sin  knx 

dx  = <<1 

j 

= 1,2,... 

,M. 

This  equation  reduces  successively  as  follows, 

M 

2 

^k  1 - 

(j")^ 

1 sin  jnx  sin  knx  dx  = 

k=l 

0 

j 

= 1,2,... 

,M. 

or 

11 

- 

M 

1 

J sin  jnx  sin  knx  dx  = 

<<t'j  ,x> 

( 


Invoking  orthogonality  of  the  eigenfunctions  ,<.  0 . ,(I>.  > = 6 , the  above 

J J K 

equation  reduces  to 


M - (jn)^l  ^ a 6 = <0,,x>  j = 


k=l 


k jk 


or 


1 - (jn)^l  a.  = <0^.x>  j = 1,2 M. 


That  is.  Equation  7 reduces  to  the  following  system  of  equations  in  the 

unknown  a . 

J 


^.2 

1 - n 

"l 

<'J',  .x> 

1 - 4 

1 - 9 

a 2 

^02.><> 

"3 

- 

' 03,x> 

, ■ ..2  2 

1 - M n 

Thus,  the  use  of  eigenfunctions  for  the  shape  functions  results  in 
a diagona 1 coefficient  matrix.  The  computational  ease  of  solving  for  a^ 
is  optimal  in  this  case.  Rut  the  eigenfunctions  are  not,  of  course, 
generally  available  a priori. 

Note  also  that  the  approximate  solution 


M 

u 


1 r— r sin  knx 

k=l  1 - k'^Tx'^ 


can  be  thought  of  as  the  projection  of  the  true  solution, 


u = 2 r— r sin  knx, 

k=l  1 - k n 


onto  the  subspace  M spanned  by  the  first  !i  eigenfunctions. 


25 


or 


^ ^1,  J ^(l-x)  - 2jx‘^  * + x-^(l-x)|  x*'{l-x)dx 

k=l  0 

= < x-^(  1 -x)  ,x  > j = 1 ,2, . . . ,M 


Entries  in  the  first  row  of  the  coefficient  matrix,  obtained  by  setting 
j = I . are 


1 

J I -2  + x(l-x)l  x(l-x)  dx 

b 


/ (-2  ♦ x(J-x)|x^  (l-x)dx,... 

0 


J |-2  + x(l-x)|x^  (l-x)dx 


Elements  of  the  second  row,  obtained  by  setting  j = 2,  are; 


r |2(l-x)  - ^x  + x^(l-x)|x  (l-x)dx 
0 


J"  |2(l-x)  - 4x  + x^(l-x)Ix^  (l-x)dx,... 
0 


/ (2(l-x)  - 4x  + x^(l-x)|x^  {l-x)dx 
0 


Thus,  the  coefficient  matrix  is  not  diagonal,  and  with  a little  more 
study  it  becomes  apparent  that  it  is  full.  For  example,  if  we  take 
M = 2 and  write  out  the  linear  system  we  get 


3/10  3/20 


3/20 


3/20 

h 

1/12 

13/103 

1"2 

1/20 

Solving  this  system  yields,  = 71/369,  an<l  = 7/41.  Therefore,  the 
approximate  solution  for  M = 2 is, 

u'''  = 71/369  x(l-x)  + 7/41  x^(l-x). 

M 

Case  3.  Let  (l>|^  be  the  finite  element  hat-shaped  functions  defined 
as  foil ows , 


0 

1/h  X - (k  - 1 ) 
- 1/h  X t (k  + 1 ) 
0 


0 ^ X 5 ( k - 1 ) h 
( k - I ) h S X ^ kh 
kh§x$(k+  l)h 
( k + 1 ) h ^ X 5 1 


Three  consecut i ve  'shape  functions  are  graphed  in  Figure  8,  where  it  can 
be  seen  that  they  satisfy  the  boundary  conditions.  Further,  they  are 
continuous  functions  over  the  entire  region  |0,1|.  They  do  not  differ 
in  this  respect  from  the  eigenfunctions  and  polynomial  functions  which 
were  employed  in  the  previous  two  cases. 
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Figure  8.  Finite  element  shape  functions  used 
as  Ritz  functions. 


They  do  differ  in  that  at  x = kh , for  example,  one  function  equals 
unity,  while  all  others  equal  zero.  Such  points  are  termed  node  points, 
and  there  the  undetermined  coefficients  become  the  value  of  the  dependent 
variable  u(kh).  Substituting  the  shape  functions  into  Equation  7 we 
get. 


2 a.  i;  (0^  + 0^)  0^  dxl  = <.0”,x>  j = 1,2,...,M. 

k=l  0 J J 

In  this  case,  however,  0^  = 0,  and  therefore  this  equation  reduces  to, 

M ) 

I a.  IJ  0”  0”  dxl  = <0”,x>  j = 1,2,. ...M. 
k=l  0 J J 

We  note  that  the  product  0^  0”  will  be  zero  unless  |j  - k|  S 1.  This 
can  best  be  seen  from  the  above  sketches  of  the  shape  functions.  For 
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example,  the  product  of  0.  and  0.  , is  nonzero,  but  the  product  of  0 

K 1 k 

and  0j^^2  zero.  Thus  the  system  of  equations  for  Case  3 is  tridiaRonal 


as  fol lows , 


a” 

0,.0, 


a” 

02.01 


>1.02 

2 ’ *^2 


[03.02 


’2 ’*^3 
*3 ’‘*'3 


[<4 


- 

"1 

<0”.’<> 

"2 

<02 

^3 

II 

<03. x> 

The  primary  observation  is  that  the  system  is  almost  diagonal;  or, 
alternatively,  orthogonality  almost  exists.  Computationally,  the  system 
of  equations  resulting  from  the  use  of  finite  element  shape  functions  is 
preferable  to  the  system  resulting  from  the  use  of  Ritz  functions  (smooth 
polynomials).  Surprisingly,  the  only  difference  between  the  Ritz  and 


the  finite  element  method  is  in  the  choice  of  the  continuous  functions 

.M 


for  use  in  the  approximate  solution 


M 

2 


k=l 


.M 


In  each  of  the  three  cases  the  0.  are  defined  continuously  over 

M ^ 

the  entire  region  ft  = |0,1]  and  0^^  e Hj^.  Ifowever,  the  differences  are 


important  and  are  summarized  as  follows: 


.M 


(a)  The  finite  element  functions  0|^  (Case  3)  are  zero  everywhere 


except  over  a subdomain  or  element.  This  reduces  the  coupling 
in  the  linear  system  given  by  Equation  7. 
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(b)  The  finite  element  functions  are  defined  such  that  they  are 

unity  at  X = jh,  j = 1,2,. ..,M,  that  is  at  the  node  points.  This 

N 

assigns  physical  significance  to  the  aj^.  That  is,  a^^  = u (kh), 
and  these  are  the  values  of  the  dependent  variable  at  the  kth 
node  points. 


SUMMARY 

The  problem  at  hand  is  generally  one  of  finding  an  approximate 
solution  to  a given  boundary  value  problem;  the  solution  must  satisfy  a 
governing  differential  equation  and  its  associated  boundary  conditions. 

The  process  of  finding  an  approximate  solution  can  be  discussed  in  a 
heuristic  way,  by  relying  on  geometrical  notions.  Though  we  actually 
search  for  solutions  within  mathematically  complex  subspaces  of  continuous 
functions,  often  from  a practitioner's  point  of  view,  significant  insight 
is  gained  by  limiting  thought  to  Euclidian  space.  It  is  helpful  to 
imagine  that  the  "true”  solution  is  a vector  imbedded  in  three-dimensional 
space,  but  that  the  search  for  an  approximate  solution  vector  is  unfor- 
tunately limited  to  some  lesser  subspace  (say,  a plane).  The  projection 
theorem  provides  the  interpretation  that  the  best  approximate  solution 
within  that  restriction  is  an  orthogonal  projection  of  the  true  solution 
vector  onto  the  lesser  subspace.  The  Ritz  (or,  more  generally,  the 
Galerkin)  method  is  the  mechanism  for  finding  that  projection. 

The  error  vector  is  the  difference  between  the  true  and  approximate 
solution  vectors  and  is  normal  to  the  subspace  when  it  is  minimized.  It 
is  measured  by  the  energy  norm.  It  is  the  business  of  theoreticians  to 
obtain  estimates  of  the  energy  norm  of  the  error  vector  for  formulations 
so  that  the  relative  accuracy  and  convergence  of  alternatives  can  be 
assessed.  Accuracy  criterion  is  based  on  the  energy  norm,  and  convergence 
occurs,  as  is  often  written,  "in  the  sense  of  the  norm."  The  energy 
norm  is  a functional;  as  such,  it  is  a particular  type  of  transformation. 
Its  evaluation  yields  a real  number  when  it  is  provided  with  an  input, 


^ ■ <• 


continuous  function.  Thus,  it  transforms  an  element  of  continuous 

function  space  into  an  element  of  real  number  space.  In  this  context,  • 

the  continuous  functions  are  finite  element  shape  functions,  and  the 
energy  norm  can  be  regarded  as  a mathematical  "measuring  stick"  for  such 
functions . 

The  subspace  of  admissible  functions  in  which  an  approximate  solution 
is  sought  is  dictated  by  the  differential  operator's  order  and  the 
boundary  conditions.  It  is  constituted  of  functions  having  continuous 
derivatives  up  to  the  ^rder  of  the  operator  and  that  also  satisfy  all 
the  boundary  conditions,  plus  some  additional  functions.  These  additional, 
admissible  functions  are  defined  by  an  extension  of  the  field  of  definition 
for  the  operator.  They  are  all  functions  that  are  obtainable  as  limits 
of  sequences  of  admissible  functions  from  the  first  group,  the  field  of 
definition.  A significant  practical  benefit  requires  mentioning  here. 

Often,  these  limit  functions  possess  lower  order  differentiation  than 
the  functions  constituting  the  field  of  definition.  Because  of  this, 
they  are  easier  to  construct  and  simpler  to  use  in  subsequent  computations. 

Further,  due  to  their  reduced  order  of  differentiation,  they  do  not,  and 
need  not,  satisfy  boundary  conditions  which  generally  involve  higher 
order  derivatives  on  the  dependent  variable;  boundary  conditions  that 
they  do  not  satisfy  are  the  natural  boundary  conditions.  Those  boundary 
conditions  that  they  do  satisfy  are  the  essential  boundary  conditions. 

Construction  of  functions  from  the  field  of  definition  is  most  often 
prohibitively  difficult  and/or  they  may  be  too  costly  to  compute  with. 

However,  if  they  are  available  and  economical,  their  use  is  preferable 
and  recommended  on  grounds  of  increased  accuracy. 

Finite  element  shape  functions  are  continuous  over  the  entire 
region  and  are  no  different  than  classical  Ritz  functions  in  this  regard. 

They  do  differ  in  that  they  are  piecewise  continuous,  each  being  defined 
as  zero  everywhere  except  over  a subdomain  (element)  of  the  region.  On 
the  other  hand,  classical  Ritz  functions  are  generally  more  smooth  and  > 

nonzero  everywhere  over  the  region.  A system  of  linear,  algebraic 
equations  that  results  from  using  piecewise  continuous  functions  is 


solved  more  easily  than  a system  that  results  from  Ritz  functions.  This 
is  because  they  are  characteristically  banded.  The  bandedness  reflects 
local  coupling  among  the  equations  written  for  individual  subdomains  in 
proximity,  whereas  global  coupling  and  full  matrices  are  inherent  with 
the  Ritz  functions.  In  either  case,  the  undetermined  coefficients  in 
the  Ritz  process  are  found  from  solution  of  the  linear  system  of  equations. 
They  have  a more  useful  interpretation,  though,  when  finite  element 
shape  functions  are  used.  In  this  case,  the  coefficients  are  the  values 
of  the  dependent  variable  at  node  points,  and  are  therefore  directly 
useable  data.  That  , they  are  often  discrete  values  of  the  solution 
sought.  In  contrast,  the  coefficients  of  classical  Ritz  functions 
cannot  be  interpreted  meaningfully.  Useable  data  are  obtainable  only 
with  further  manipulation  of  the  coefficients  after  their  numerical 
values  are  found. 


The  finite  element  method,  in  large  measure,  owes  its  success  to 
piecewise  continuous  shape  functions.  It  remains  fundamentally  a Ritz 
(more  generally,  Galerkin)  approximation  process,  but  because  shape 
functions  are  simple  and  amenable  with  the  digital  computer,  the  finite 
element  method  has  transcended  the  classical  method  of  Ritz  in  application 
and  popularity.  Nevertheless,  it  is  instructive  to  understand  the  Ritz 
process  as  a method  for  minimizing  a functional.  The  function,  within  a 
prescribed  space  of  admissible  functions,  that  minimizes  the  functional 
is  found  by  the  Ritz  process  and  is  also  an  approximate  solution  to  the 
boundary  value  problem.  A certain  amount  of  mathematics  from  functional 
analysis  is  required  to  appreciate  the  Ritz  process,  and  the  finite 
element  method  as  well.  It  begins  with  the  definition  of  linear  vector 
spaces  and  proceeds  through  to  the  definition  of  Hilbert  spaces.  The 


f 'iiathematics  can  be  presented  for  practitioners  by  relying,  where  possible, 

L 

; on  geometric  interpretation  in  more  familiar  Euclidean  space. 
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